NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Genomic and transcriptomic perspectives on the origin and evolution of NUMTs in Orthoptera

https://doi.org/10.1016/j.ympev.2024.108221

Liu, Xuanzeng; Liu, Nian; Jing, Xuan; Khan, Hashim; Yang, Kaiyan; Zheng, Yanna; Nie, Yimeng; Song, Hojun; Huang, Yuan (December 2024, Molecular Phylogenetics and Evolution)

Full Text Available
Review on functional data classification

https://doi.org/10.1002/wics.1638

Wang, Shuoyang; Huang, Yuan; Cao, Guanqun (January 2024, WIREs Computational Statistics)

Abstract A fundamental problem in functional data analysis is to classify a functional observation based on training data. The application of functional data classification has gained immense popularity and utility across a wide array of disciplines, encompassing biology, engineering, environmental science, medical science, neurology, social science, and beyond. The phenomenal growth of the application of functional data classification indicates the urgent need for a systematic approach to develop efficient classification methods and scalable algorithmic implementations. Therefore, we here conduct a comprehensive review of classification methods for functional data. The review aims to bridge the gap between the functional data analysis community and the machine learning community, and to intrigue new principles for functional data classification. This article is categorized under:Statistical Learning and Exploratory Methods of the Data Sciences > Clustering and ClassificationStatistical Models > Classification ModelsData: Types and Structure > Time Series, Stochastic Processes, and Functional Data
more » « less
Full Text Available
Functional data analysis using deep neural networks

https://doi.org/10.1002/wics.70001

Wang, Shuoyang; Zhang, Wanyu; Cao, Guanqun; Huang, Yuan (July 2024, WIREs Computational Statistics)

Abstract Functional data analysis is an evolving field focused on analyzing data that reveals insights into curves, surfaces, or entities within a continuous domain. This type of data is typically distinguished by the inherent dependence and smoothness observed within each data curve. Traditional functional data analysis approaches have predominantly relied on linear models, which, while foundational, often fall short in capturing the intricate, nonlinear relationships within the data. This paper seeks to bridge this gap by reviewing the integration of deep neural networks into functional data analysis. Deep neural networks present a transformative approach to navigating these complexities, excelling particularly in high‐dimensional spaces and demonstrating unparalleled flexibility in managing diverse data constructs. This review aims to advance functional data regression, classification, and representation by integrating deep neural networks with functional data analysis, fostering a harmonious and synergistic union between these two fields. The remarkable ability of deep neural networks to adeptly navigate the intricate functional data highlights a wealth of opportunities for ongoing exploration and research across various interdisciplinary areas. This article is categorized under:Data: Types and Structure > Time Series, Stochastic Processes, and Functional DataStatistical Learning and Exploratory Methods of the Data Sciences > Deep LearningStatistical Learning and Exploratory Methods of the Data Sciences > Neural Networks
more » « less
Full Text Available
High-dimensional causal mediation analysis based on partial linear structural equation models

https://doi.org/10.1016/j.csda.2022.107501

Cai, Xizhen; Zhu, Yeying; Huang, Yuan; Ghosh, Debashis (October 2022, Computational Statistics & Data Analysis)

Full Text Available
Bayesian hierarchical finite mixture of regression for histopathological imaging‐based cancer data analysis

https://doi.org/10.1002/sim.9309

Im, Yunju; Huang, Yuan; Huang, Jian; Ma, Shuangge (March 2022, Statistics in Medicine)

Full Text Available
An overview of tests on high-dimensional means

https://doi.org/10.1016/j.jmva.2021.104813

Huang, Yuan; Li, Changcheng; Li, Runze; Yang, Songshan (March 2022, Journal of Multivariate Analysis)

Full Text Available
Van der Waals heterostructures

https://doi.org/10.1038/s43586-022-00139-1

Castellanos-Gomez, Andres; Duan, Xiangfeng; Fei, Zhe; Gutierrez, Humberto Rodriguez; Huang, Yuan; Huang, Xinyu; Quereda, Jorge; Qian, Qi; Sutter, Eli; Sutter, Peter (December 2022, Nature Reviews Methods Primers)

Full Text Available
Bayesian finite mixture of regression analysis for cancer based on histopathological imaging–environment interactions

https://doi.org/10.1093/biostatistics/kxab038

Im, Yunju; Huang, Yuan; Tan, Aixin; Ma, Shuangge (November 2021, Biostatistics)

Summary Cancer is a heterogeneous disease. Finite mixture of regression (FMR)—as an important heterogeneity analysis technique when an outcome variable is present—has been extensively employed in cancer research, revealing important differences in the associations between a cancer outcome/phenotype and covariates. Cancer FMR analysis has been based on clinical, demographic, and omics variables. A relatively recent and alternative source of data comes from histopathological images. Histopathological images have been long used for cancer diagnosis and staging. Recently, it has been shown that high-dimensional histopathological image features, which are extracted using automated digital image processing pipelines, are effective for modeling cancer outcomes/phenotypes. Histopathological imaging–environment interaction analysis has been further developed to expand the scope of cancer modeling and histopathological imaging-based analysis. Motivated by the significance of cancer FMR analysis and a still strong demand for more effective methods, in this article, we take the natural next step and conduct cancer FMR analysis based on models that incorporate low-dimensional clinical/demographic/environmental variables, high-dimensional imaging features, as well as their interactions. Complementary to many of the existing studies, we develop a Bayesian approach for accommodating high dimensionality, screening out noises, identifying signals, and respecting the “main effects, interactions” variable selection hierarchy. An effective computational algorithm is developed, and simulation shows advantageous performance of the proposed approach. The analysis of The Cancer Genome Atlas data on lung squamous cell cancer leads to interesting findings different from the alternative approaches.
more » « less
Species-specific partial gene duplication in Arabidopsis thaliana evolved novel phenotypic effects on morphological traits under strong positive selection

https://doi.org/10.1093/plcell/koab291

Huang, Yuan; Chen, Jiahui; Dong, Chuan; Sosa, Dylan; Xia, Shengqian; Ouyang, Yidan; Fan, Chuanzhu; Li, Dezhu; Mortola, Emily; Long, Manyuan; et al (December 2021, The Plant Cell)

Abstract Gene duplication is increasingly recognized as an important mechanism for the origination of new genes, as revealed by comparative genomic analysis. However, how new duplicate genes contribute to phenotypic evolution remains largely unknown, especially in plants. Here, we identified the new gene EXOV, derived from a partial gene duplication of its parental gene EXOVL in Arabidopsis thaliana. EXOV is a species-specific gene that originated within the last 3.5 million years and shows strong signals of positive selection. Unexpectedly, RNA-sequencing analyses revealed that, despite its young age, EXOV has acquired many novel direct and indirect interactions in which the parental gene does not engage. This observation is consistent with the high, selection-driven substitution rate of its encoded protein, in contrast to the slowly evolving EXOVL, suggesting an important role for EXOV in phenotypic evolution. We observed significant differentiation of morphological changes for all phenotypes assessed in genome-edited and T-DNA insertional single mutants and in double T-DNA insertion mutants in EXOV and EXOVL. We discovered a substantial divergence of phenotypic effects by principal component analyses, suggesting neofunctionalization of the new gene. These results reveal a young gene that plays critical roles in biological processes that underlie morphological evolution in A. thaliana.
more » « less
Feature screening in ultrahigh-dimensional varying-coefficient Cox model

https://doi.org/10.1016/j.jmva.2018.12.009

Yang, Guangren; Zhang, Ling; Li, Runze; Huang, Yuan (May 2019, Journal of Multivariate Analysis)

Full Text Available

« Prev Next »

Search for: All records